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DECISION MAKING AND PROHLEM SOLVING WITH COMPUTER ASSISTANCE 


P. Kraiss 

Research Institute for An thro po technology 
Meokenheim, West Germany 


In this report the results achieved in a research project of 
the BMVg on the subject of "Man-Computer Interaction in Guidanoe 
and Control Systems" (T/RF 36 / 60024 / 61606) are summarized. 

In the period from February to November 1977 the author worked at 
the NASA Ames Researoh Center, Moffett Field, California in the 
Man-Machine Integration Researoh Division to conduct work for this 
project, 

Numerous contributions from the research projects being con- 
ducted there and from many discussions with American colleagues 
find their echo in this report. Various important bibliographical 
sources first became accessible there, as did participation in 
internal courses and activities. 


I extend a hearty thanks to the NASA Ames Researoh Center 
for these services and for the excellent working conditions, 

Mr. Klaus-Peter Holzhausen and Mr, Willi Stein, both engineers 
have taken the time and effort to review this report oritically. 

I also wish to extend my thanks to them, 

1 , Introduction ^ 

Modern guidance and control systems serve for the control, 
coordination and communication in complex technical systems. 

The tasks delegated to the operator in his changing roles as 
manager, supervisor, problem solver, trouble shooter, etc, 
essentially require the following mental capabilities: 

observation (selection of data relevant to the task) 

- correction 

- decision making 

problem solving (finding the procedure for the solution 
and effectively representing the problem) 

- association (analogous conclusions) 

- deduction (understanding relationships, solving multiple 

meanings via comparisons with stored knowledge and 
experience). 

This mental work must often be conducted under extreme work 
loads and pressure of time. As a rule large amounts of data have 
to be recorded and processed. Portions of these data contain mis- 
takes, Changed aims and boundary conditions must continually be 
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taken into consideration 


Prom this description of tasks it oan be seen that the mental 
stress on tha operator is very large in normal situations. On the 
other hand the work is facilitated by the fact that the working 
method of operator is already programmed to a great extent, i.e., 
the same work sequenoes and problem types occur and reooour. 

The prooess of decision making and problem solving in a situ- 
ation typical in guidance and control systems may be divided into 
the following areas . £1] : 

- Does a decision have to be made or is a problem to be solved? 

- Observation of the prooess and surrounding situation 

- Selection of data relevant to the decision 

- Interpretation and reduction of data 

- Does the system condition observed deviate from 
expectations provided by internal models (experience, 
training)? 

— Is this a new problem? a 

- Is a routine solution possible? 

- May the problem be divided into partial problems with 
known solutions? 

- Has this situation already occurred before? What deci- 
sion was made then and with what degree of sucoess? 

- Which paths of solution or alternatives for dealing with 
the problem are there? 

- Selection of a strategy (conservative, risky) 

- Is additional information necessary? What would this 
cost? 

- Test and evaluation of possible consequenoes 

- How useful will the solution be? 

- Is the occurrence of the expected consequence certain 
or only probable? 

- How great is the risk of an undesired consequence? 

- How could an unfavorable consequence be met? (Calculating 
steps in advance) 

- Carrying out the action with the greatest usefulness 
expected, that is with the least disadvantage. 

- Observation of actual effects. 


Experience teaches that a man cannot optimally conduct this 
complex sequence of decision making. He regularly deviates from 
behavior predicted on the basis of normative models, reasons being 
erroneous information processing and storing in the brain, lessening 
concentration, prejudice and inconsistant behavior. In addition a 
certain degree of uncertainty exists for the operator with respect 
to available alternatives, possible consequences, surrounding and 
boundary conditions and alsu individual preferences . 


The oresent report studies the possibilities for aotively 
supporting the operator in carrying out his tasks with guidanoe 
and oontrol systems. 

The guiding thought in this project is to reduoe human error 
by means of oomputer support, to employ a oomputer for supporting 
operator intelligence. 

A certain degree of intelligence (Artificial Intelligence, 

A. I.) is attributed to oomputer programs more or less oapable of 
reproducing the above-mentioned mental performance of the operator 
[3j • Considerable progress has been made in this area in the past 
few years, especially with respect to pattern recognition and classi- 
fication. This has led to the application of fully automated robots 
in manufacturing processes, which previously only men oould oarry 
out. 


The transition to full automation, however, is substantially 
more problematic in guidanoe and oontrol systems than in industrial 
procedures, one reason being that the available maohines are not 
yet intelligent enough for many tasks. In addition it must be taken 
into consideration that the operator has to have the last word in 
many situations. It is, for example, difficult to imagine a fully 
automated air traffic guidance system without human oontrol 

Therefore the replacement of man by intelligent maohines is 
not to be discussed here, but on the contrary a symbiotic rela- 
tionship between man and computer, in whioh the computer assumes 
the role of a partner, an intelligent associate. In this man- 
computer team each partner continuously assumes that portion of 
the total task fitting his powers and capabilities. 

Notice that this approach substantially differs from a partial 
automation in which only routine work is delegated to the computer. 

In automated systems the operator only takes part sporadically, but 
does not receive any help from the computer in decision making in 
the usually critioal situations. 

It will be 'urther demonstrated that intelligent support of 
the operator in interaction with him is possible on various levels 
of the decision making process. The applied artificial intelligence 
algorithms comprehend pattern recognition and classification, 
adaptation and machine learning as well as dynamic and heuristic /6 
programming. Elementary examples are presented to explain basic 
principles of individual procedures. 

Consequences for design of future man-machine-systems, re- 
sulting from a cooperation between human and artificial intelli- 
gence, are discussed, 

2. Cooperation with Computers capable of Learning and Adapting 

A graphic representation of possibilities for the cooperation 
between humans and intelligent machines is presented in Figure 1. 
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Aside from the known man-maohine controlled oirouit a computer is 
presented here arranged parallelly to the opera tor • 

Since the computer is first untrained, the machine must re- 
ceive information about the intelligent behavior expeoted. This 
oan be oarried out explicitly by programming normative models for 
desired operator behavior or by presenting aims or strategies. 

An entire series of mathematical models is available for this pur- 
pose, e.g. for reading instruments, discovering signals, prooessing 
information and making decisions (theory of usefulness, analysis 
of deoisions). Rouse gives a survey of the development of such 
models [ 5 ] • 

Another implioit possibility for instructing the computer 
consists in constructing an internal oomputer model oapable of 
adapting by means of continuous observation of operator behavior. 

This behavior is expressed in the aotions taker, for an observed 
situation. In time a descriptive model of ever increasing reliability 
is created, ideally mirroring operator preferences adequately. Suoh 
models may therefore be employed for simulating humans and predicting 
what they intend to do. Building such a model is only possible if the 
operator behaves consistently for an extended period of time. This 
means that he makes comparable deoisions in identical situations and 
retains his strategy. 

Once the computer is trained for a defined iuix-ier of behavior, /2 
cooperation oetween human and oomputer may be oarried out on various 
levels: the computer is relegated to an advisory position, i.e. with- 
out independent responsibility it merely automatically indicates 
suggestions for dealing with problems. This may be applied, e.g, 
to the training of an operator along the guidelines of a normative 
model. Of course it is left to the discretion of the operator to 
call for further information from the oemputer. Among other things 
the following would be interesting for a pilot, "Which measures are 
to be oarried out, if an engines fails in the next four minutes." 

On the other hand the computer may also direct questions to the 
operator if it determines that additional information is lacking for 
the calculation of internal algorithms (dialogue initiated by oom- 
puter). 

The computer advances from an advisory function to a line 
function with independent responsibility when its suggestions are 
not only indicated to the operator but are directly oarried out. 

The important question here is the division of tasks between man 
and machine and the interactive control. The various logical levels 
are expressed in the square "Task delegation" (Fig, 1): 

- Where required manual delegation of partial tasks to the 
computer for autonomous processing. 

- Computer independently assumes task portions when it 
determines that its internal model functions with a 
sufficient degree of reliability. 

- Computer waits and only goes into action when the operator 
shows no reaction. 


4 


- Computer blooke doubtful operator deoieione and gires the 
operator an alarm signal In oaaea of oonfliot. 

The following eeotiona of this report desoribe individual 
algorithms and prooedures applied to instruot the computer in 
desired intelligent behavior and to organize oooperation between 
man and maohine. 

Possibilities for interaction in processing are studied for ]£ 
the following tasks: 

- preparation and evaluation of information (Chapter 3) 

- deoision making (Chapter 4) 

- problem solving (Chapter 5) 

3. Computer Support in Preparation and Evaluation of Information 

The derivation of probabilities for oertain hypotheses under 
consideration of additional new information is a task frequently 
occurring in the process of deoision making* 

The influence of new data on the probability of an hypothesis 
is determined mathematically according to the rule of Bayes. This 
provides a normative model, a performance measurement for how well 
or poorly a man may estimate the oonsequenoe and significance of new 
information. 

3 .1 The Rule of Bayes [6] 

It is assumed that there are two hypotheses H. and H 2 , each 
excluding the other and also complementary. Then the valid formula 
for the a priori probability is: 

p(H 1 ) ♦ p(H 2 ) - 1 

The relationship between the beginning probability for a hypo- 
thesis and the subuequent probability dependent on an observation 
D is given by the rule of Bayes: 

, ( H. i d) - trnmvji , , D) . mjutm . 


The sum of these probabilities may also be set equal to 1: p(D) 
may be substituted in the above equation: 


>(//, | D) 


p(D | H,)p(H t ) 

p[D I //,)>(//,) + P(D I //,)>(//,) ' 


This equation describes the influence of additional information D /10 
on the probability of am hypothesis. The subsequent dependent 
probability of an hypothesis can therefore be calculated if the 
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probability for the ooourrenoe of a defined bit of information 
may be estimated under the assumption that the hypothesis is 
oorreot (p(DIH)). 

A generalisation of the above desoribed relationship for more 
than two (i) hypotheses and n repeated applications of the Bayes' 
Rule (several observations) results in the equation for independent 
observations and exclusive and all-enoompaesing hypotheses (£]: 


P(H t | D | ( Oj D m ) 


m **>• I "«)>.- .W) 
£,P(D.\ 


3,2 An Kxanple for the Revision of i robabilities accordin'; to :;ayeu 

A whale and ~ submarine have been sighted in an area of the 
ooean. The location of the submarine is to be determined. Two 
sensors are available for the searoh: 


Type of Sensor Charaoteristios Reliability Cost 


Sonar Booy rcaots to objects, 

does not differenti- 
ate between whale 
and submarine 

Helioopter reacts only to 

submarine 


low 


very 

high 


The ooean area is divided into four quadrants for the searoh 

(compare page 13, upper section). It is assumed in this example 

that sensors applied cover each entire quadrant. For the positions 

of submarine and whale there are first a priori probabilities of 

equal weight p rt (H„ ), p.(H w ) ■ 0.25, whioh exclude each other and 
o o 

therefore add \ip to 1 s 



The search begins when a sonar buoy is positioned randomly in /1 1 
quadrant 2. A signal is given that an object is positioned there 
(whale or submarine). The reliability (probability) of this find- 
ing is 0.7, resulting in revised probabilities for the above-defined 
hypotheses: 
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... . _ 0,7 * 0,25 0 7 

P(H I DI) - 5(T T-5 ( a + 6,y: + STTn^Ji'+TSTHSTB ‘ ti~ 0 '* 3 


p(H u' D,) ’ 0,3 • 67}ri6,7 • 6,'ls*6,i . a, Si + 6, 3 - d.H ’ 


0,3 


p(H. ion ■ p(H 10 . 1 ) 


The sane revised values result for the hypotheses with respeot 
to the assumed position of the whale, p(Dl IH U ) - 0,7 is therefore 

the probability for the ooourrenoe of observation D1 under the 
assumption that the hypothesis (submarine is in quadrant 2) is 
oorreot, u 2 

The next step is to searoh quadrant 2 with the helicopter, 
to asoertain whether the object positioned there is a submarine. 
Negative results are obtained. Reliability of this measurement 
is 90 resulting in the following revised pro babili ties * 



The values for the position of the whale are not affected by this 
sensor report. It is then determined by a sonar buoy that there 
is an object in quadrant 1. This determination also has an error 
quotient of 30 resulting in: 


P(H^I D3) 


0,7 . 0,31 

0,7 . 0,31 + 0,3 • 0,08 + 0,3 • 0,31+ 0,3 . 0,31 


= M1Z 

0,405 


0,53 


p(H I D3) 
u 2 

p(H I 03) 
W 1 


0,05 ; p (H ID3) = p(H I D3) = 0,23 
°3 u 4 

Q» 7 * °> W s ^ o,35 

0,7 • 0,19+0,3 • 0,43 + 0,3 • 0,19 + 0,3 • 0,19 0,376 


p(H I D3) 
2 


= 0,34; p(H 103) = p(H I D3) = 0,15 
w 3 4 


Since there are a total of only two objects and these are pro- /1 2 

bably in quadrants 1 and 2, it is now logical to search quadrant 1 
with the helicopter for the submarine. A signal is given that a 
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Probability for tho Poaition of 





I 

i 


Starting situation and 
a priori probabilities 
for the hypotheses sst 
down 


1 . 


Sonar Buoy positioned! 
Information gained! Objeot in 

quadrant 2. 
70 £ certain 


2, Helioopter employed: 

Information gained: No sub- 
marine in 
quadrant 2, 
90 H certain 


3. 


Sonar Buoy positioned: 
Information gained: Object in 

quadrant 1 , 
70 i» oertain 


4* Helioopter sent to quadrant 1: 
Information gained: Submarine in 

quadrant 1 . 

90 £ oertain. 


Submarine Whale 


1 

.25 

2 

.25 

.25 

.25 


4 .25 

.25 

.25 



J 9 

.43 

.19 

.43 

.19 

.19 

.19 

.19 




.31 

.08 

.31 

.31 

.53 

• 

.05 

.35 

.34 

.23 

.23 

.15 

.15 


.91 

.0 

.04 

.04 


Table 1 : Development of Probability Revision for the Example 

Submarine Looatlon. 




• • ■ -- - ... . » ta 
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submarine was found thors with a probability of 90 i», resulting in: 


p(H 


041 * 6,9 • 0 ,M + 0 ,TV ! 5 or I ^Vr- 4,53 + 6,1 . 6,JJ ■ S^~ 0 ' 9 ' 


p(H I D4) • 0| p(H |D4) - p(H I D4) ~0,04 



r 


The eubmarlne is therefore positioned in quadrant 1 with a probabil- 
ity of 91 Of course It depends on the strategy seleoted with 

what degree of certainty the reoonnoissanoe is to be carried out, 
how muoh the search may oost and when the searoh is to be discon- 
tinued. 

The development of probability revision for the positioning 
example is clearly summarized in Table 1 , The reader should examine 
whether he would have come to the same oonolusion subjectively. 

Note also that the effect of an additional bit of information 
on the probability of individual hypotheses depends on the number 
of a priori hypotheses. The positioning of a sonar buoy, having a 
reliability of 70 S, increases e,g, in the case of a positive signal 
the probability of an hypothesis "submarine present" to the following 
values 


from 

0.5 

to 

0.7 

in 

the 

case 

of 

2 

a priori hypotheses 

from 

0.25 

to 

0.43 

in 

the 

oase 

of 

4 

a priori hypotheses 

from 

0.1 

to 

0.21 

in 

the 

case 

of 

10 a priori hypotheses 

from 

0.05 

to 

0.11 

in 

the 

case 

of 

20 

a priori hypotheses. 


3.3 The Probabilistic Information Processor of Edwards 111 

Numerous experiments demonstrate [6] , that men usually do 
not correotly estimate the significance of new additional inform- 
ation, Often a conservative behavior and a tendenoy to avoid 
extreme estimated values is observed. In general a considerable 
deviation from the values determined "oorrect" according to the 
Bayes' Rule is to be expected. 

The probabilistic information processor (PIP) ofEdwards (1963) 
i 3 based on this observation [6,7], The basic idea for this pro- 
cedure is to have the Bayes' arithmetic processed by the computer 
while men still construct the basis for the a priori probabilities 
and for reliability of additional information (Pig, 2;. 

The function of this arrangement may be described as follows: 
on the lowest level information is collected, sorted and selected. 

On the next level experts attribute the individual data a signifi- 
cance in the form ptD^IH^, an expression for the probability of 
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Fig. 2: Probabilistic Information Processor of Edwards (from [ 6 ]) 

information under the assumption that hypothesis H,, 

A computer (Bayes' Processor) carries out the revision for 
probability of a priori hypotheses with these data* On the basis 
of these revised probabilities other experts then make decisions, Li 
that is, formulate alternative a priori hypotheses* 

Numerous studies show that the PIP system is a considerable 
aid in the deoision making prooess* One example is given in Fig. 3 
without a more exaot description of the basio situation* The de- 
viation of broken lines from the unbroken ourve shows how muoh 
human error oocurs in probability revision and how conservative 
the deoisions are in eaoh case. 



Fig. 3: Posterior Probabilities for the Correctness of an 

Hypothesis Estimated Subjectively ( ) and Oaloulated 

with the Aid of PIP ( ) [22] 


11 



It should be noted that the prooedure desorlbed Is only 
suitable for processing of relatively neutral teohnioal questions 
in whioh the alternatives are well defined. 

4. Computer Support in Deoision Making /1 6 

4.1 Decision 'Types and Usefulness 

A one-dimensional deoision situation is characterised by al- 
ternatives A^ and by surrounding situations B^, When the alterna- 
tive A^ is selected, while the surrounding conditions ara B^, the 
oonsequenoe is C^. Thi> situation is demonstrated in the graphic 
of the matrix (Pig. 4). 


Graph of a deoision making 
situation: when A i is chosen 

at surrounding conditions B^ 

the consequence is 

(from [6) ) 

a. alternatives 

b. surrounding situation 

c. consequences. 


depends on the makeup of 
surrounding conditions B^. A differentiation is made: 

a. Decisions which are certain or without risk, when the sur- 
rounding conditions are known. 

b. Decisions entailing some risks when for B. only probabilities 

may b^ given. J 

This is also termed the "game against nature " (the sum of 
all p(B, ) is 1 ). 

c. Uncertain decisions, where for B^ no probabilities are known. 

d. Games where B^ is determined by another person. The players 

are opponents where the victory of one is equal to the loss 
of the other (games with a resulting sum of zero). 

e. Dynamic decisions where a series of decisions is to be made 
and the matrix values A,B,C of individual decisions are in- 
fluences by previously made decisions. Where no such influence 
occurs, the corresponding terminology is static decision. 

Making a decision means, for all described decision types, 
selecting the consequence with the greatest advantage from several. 

The "usefulness" of a decision is a numerical index indicating how /1 7 





Pig. 4: 


Key: 


Classification of decision types 



strong the preferenoe of a certain person la for »nis oonsequence. 

The determination of usefulness index and the description and 
construction of models of human decision making behavior is subject 
of the decision analysis. The decision theory (Sheridan, Ferrell [6] 
Edwards, Tversky (fcj , Kaufmann [9] ) supplies normative, prescriptive 
models as aids in decision making and descriptive dmpirioal models 
which oan be tested for the description of an observed decision 
making behavior. 

Within the structure of the decision making theory it is ne- 
oessary to make certain plausible assumptions on how the prefer- 
ences of a decision maker evolve. A simple usefulness struoture. 
also employed in the following examples, may be based, for example, 
on the following assumptions: 

- There must be several consequences C^j. Formulation of these 

oonsequenoes in often difficult oinoe the objectively identi- 
cal facto may be subjectively evaluated differently. 

- When comparing naira of two oonsequenoes x,y from G, either 
x is preferred (xPy) or y (yPx) or neither of the two (xly), 

(|- indifferent). 

- For each consequence x there is a usefulness funotion EU(x). 1 ^ 

- If xPy, then EU(x) would be greater than EU(y), 

The preferences remain constant in time. 

The usefulness of multiple dimensional surrounding conditions 
may be determined by the usefulness of the elements. This means 
that the funotion of usefulness must make an Interval scale 
possible. 

The usefulness of a determined alternative may be oompared for 
various persons. This assumption makes it possible to oombine 
the judgements of various persons. 

Usefulness struoture is transitive, i.e. if xPy and yPz, then 
x is also preferred to the alternative z (xPz), 

Here is a brief example for dealing with an usefulness function 
EU: in the case of uncertain decisions only the probability of a 
consequence is known, as was already explained. Under certain 
weather conditions (surrounding conditions 3^) it will, for example, 

rain with a probability and with a probability of p s » 1 - p^ 

the sun will shine. Assumed that a person has to deoide under 
these weather conditions whether to take a rainooat or not, then 
each of the four logioal alternatives in this lottery is apparently 
connected to a subjective usefulness expectation, as shown in ths 
following matrix: 


u 


Head: Expected Usefulness of Consequence x 



Rain 

Sun 

Coat 

eo ko 

EU SC 

No Coat 

eu rn 

^SH 


A simple model for calculation of the total expeoted usefulness 
for action alternatives consists in adding the usefulness weighted 
by probability of occurrence of all possible consequenoes of an 
action: 

EU Coat “ EU RC x PR + EU SC x p S 

EU No Goat " blU RN x PR + EU SN X P S 

4.2 Static Deolslon Making Aids 

A method for determining the preferences of a person is to 
play through all possible decision situations. The decision made 
may then be registered and stored in a computer which supplies the 
prepared solution for every decision to be made. This procedure 
has no praotical application when the deoision is based on sur- 
rounding conditions characterized by an entire series of faotors. 

A more rational method for determining a computer algorithm 
for evaluation of multiple dimensional alternatives is indicated 
by Yntema and Klem [lOJ. The procedure is demonstrated in the 
following simplified example of a certain decision: the decision /1 9 
to be made consists in the selection of one of two nearby airports 
for an emergency landing. The surrounding situation is described 
by the parameters cloud height, visibility and fuel reserves, for 
which in each case 5 individual values are assumed. (Fig. 5). One- 
hundred- twenty-five different surrounding conditions result with 
this assumption. 


Level 

Cloud Height 

Visibility 

Fuel Reserves 

1 

100 

1/4 

15 

2 

300 

1/2 

30 

3 

1000 

1 

60 

4 

2000 

2 

120 

5 

5000 

5 

250 


Fig. 5: Surrounding Conditions for the Decision Making 
Task described in the Text (Yntema, Klem 0oj) 

The "correct” aiiport for all situations could be experi- 
mentally determined by presenting experienced pilots with all 
possible parameter combinations for airports 1 and 2 in pairs. 
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The process would be so complicated that this method cannot be 
ourried out praotioally. The alternative procedure of Ynteraa and 
Klem 00] consists in having experienced pilots determine working 
values in the form of an interval soals for the decisive values 
of individual surrounding parameter (see Pig, 6. left-hand side). 

Then the individual parameter are soaled (see Pig, 6, right). Scale 
x (cloud height), y (visibility) and z (fuel reserves) thus result. 
The expeoted usefulness (EU) which a oertai i parameter structure 
has fox the decision to be made may be determined for each individual 
parameter by linear interpolation of the corresponding soale, for 
example: 

BU(x) - C 1 * C 2 x (* ■> 



Basic Value Soale Scales for Individual Parameters 


B 


K>0 ft, 5 ml, IS (jot A 


100 ft, S ml, *40 gaM 


9000 ft, 9 mi, 290 got A 


- 100 ft, l/4 mi, 19 fat 



- 900011, 1/4 mi, 19 gel 

■ 100 It 

- 1/4 m> 

- 100 ft, 1/4 mi, 290 gel 

• 

■ 300ft 

. l/2 mi 

- 9000 ft, 1/4 mi, 290 gal 

■ 1000 ft 

- 1 mi 
- 2 mi 

- 9000 It, 9 mi, 19 gal 

-200011 

-9000 ft 

- 9 mi 

i 

cavu a lots or rust 

N 1 

ft ‘ 


U }Ogal 


•0 1*1 


I- (90 I*' 


Fig, 6: Working Values (Interval Scales) for the Basic Values 

of Surrounding Parameter (left) and for Individual 
Parameter (right) according to Estimates made by 
Experienced Pilots (Yntema, Klem \10] ) 

For the three-dimensional case in point in which three inter- 
polated surrounding parameter are taken into consideration, the 
expected usefulness is calculated by the equation: 

EU(x ? y,z) » A + Bx +■ Cy + I)z + Exy + Pyz + Gxz + Hxyz 

The constants A to H are selected in 3uch a manner that the values 
calculated for the "basic combinations" are in agreement with those 
of the basic value scale (Pig. 6, left). 

In a certain decision making situation the expected useful- 
ness of each of the two alternative airports is determined by 
the computer corresponding to the above equation and then that 
one iii seleoted with the greater expected usefulness. 
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When the decisions determined by the computer in this manner 
are compared with the decisions of pilots, it is demonstrated that 
the differences between individual pilots are greater than between 
any single pilot and the computer, The algorithm in therefore - 
on the average - a better model for decision making behavior of 
one pilot than any randomly chosen other pilot. The decisions of 
the algorithm are fully consistent. Pilot and computer algorithm 
deviate from one another merely in approx, 4 A of all oases. The 
application of a decision making aid with suoh a small error quo- /21 

tient could be helpful in oases in which any deoision would be 
better than none, bnder suoh conditions the computer could be 
instructed to wait for a period of time until it takes an action 
appropriate for the operator, if the operator shows no reaotion. 

The interval stales in Pig, 6 are determined previously and 
usually not in genuine situations. The decision making strategy 
of the computer based on these scales is therefore fixed and not 
capable of adaptations, A continuously adjustment to new know- 
ledge and information or a ohange of strategy is not possible with 
this procedure. 

Computer algorithms oapable of adapting, adjusting their decision 
making behavior continuously to the situation and to the strategy 
preferred by the operator, are discussed in the next seotion. 

4 .3 Dynamic Deoision Making Aids Capable of Adaptation 

In a deoision making situation, characterized by various boun- 
dary conditions and consequences, there is as a rule a series of 
various situations leading to the same decision. These may then 
be categorized in classes in which each situation is characterized 
by a typical structure of certain parameter, i.e. by a characteris- 
tics vector. Division of classes changes with decision maker changes 
in strategy. 

When the decision making analysis is to be transmitted to 
the computer, the necessity arises for automatically carrying out 
classification of characteristics vectors. In addition a contin- 
uous check is to be made whether the classification determined 
adequately reproduces the desired decision making behavior. Where 
necessary, the classes are to be adjusted to the altered boundary 
conditions adaptively. 

There are numberous mathematical approaches to the problem 
of classification of characteristics. Solid introductions to 
these questions may be found in Nilsson [11J , Duda and Hart [12) , 

Young and Calvert [13] a3 well as Pu [14] , 

4,3,1 Classification of Characteristics Vectors /22 

The function of a relatively simple classifier is to be ex- 
plained here as far as this is required for understanding following 
examples. In principle we are dealing with the known Steinbuch 
learning matrix, the design of which is given in Pig, 7. This ar- 
rangement examines a series of characteristic vectors X and places 
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them in two categories (+1,-1), Training of this learning matrix 
oonsists in determining the weight vector W in such a manner that 
eaoh veotor X is placed in the oorreot (desired) class. 
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Pig, 7: Classifier for 2 Categories with Linear differentiation 

Function (Learning Matrix) according to Nilsson [1 1j 


Key: 


a, expanded characteristics vector 

b, characteristics vector 

c, weights 

d, differentiation function 

e, summation instrument 

f, threshhold element 

g, answer 


Placement in a category is decided by the differentiation 
function g(X), resulting from multiplication of weight factor W 
with the characteristics veotor expanded by "+1 M , as demonstrated 
in Fig, 7. 


In training the learning matrix the actual placement of a 
characteristics veotor is compared with the desired placement* 
When placement is correct, no correction is made and the next 
characteristics vector is examined. In the case of an erroneous 
placement a correction of weight vector is made according to the 
following rule: 

W’ » W + c.Y when placement is made to category 2, but place- 
ment to category 1 (+1 ) is desired, 

W 1 » W - c.Y in the opposite case. 


For selection of value of c there are various procedures, con- /23 

verging at different rates, but which lead to the same result. 

For linear differentiation functions g(X) (only this simplified 
case will be taken into consideration here) this training supplies 
a solution after a finite number of steps in every case. If the 
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categories may not be separated In a linear manner, there Is a re- 
mainder error (Slagle [15)). In this oase the training may also 
be discontinued after a finite number of steps when the remainder 
error has reaohed the desired size. 

The following example (of Nilsson [ll] ) serves to demonstrate 
training procedure for learning matrix: 

eight three-dimensional veotors are to be plaoed in the categories 
-*-1,-1 in the required manner (Table 2). The object of the searoh 
is a weight veotor W whioh ensures that the learning matrix oan 
perform this placement. 



Expansion desired 
Placement 


Veotor No 


Components 


Table 2: Characteristics Veotors and Desired Placement [1 1] 

The course of training for a correction procedure with fixed 
increment c ■ 1 is shown in the Table below. The learning process 
is completed, as oan be seen, after 29 patterns have been presented 
and after 13 corrections in weight veotor have been made. 


Key: 

a. Expanded 
characteristics 
vector Y 

b. Weights 

o. Threshhold 
element 

d. Answer 
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Table 3: Course of training for the example explained in the 

text according to a correction procedure with fixed 
increment c ■ 1 til] 

Key: a. expanded characteristics veotor 

b. weight vector 

c. placement 

d. desired placement 

e. correction ? 

f. new weight veotor 

The 2 category-classifier described carries out the most 
simple possible placement tasks. Normally the task consists in 
sorting all patterns of a group X in such a manner that these are 
placed in only one category and not two. For this purpose an ex- 
panded learning matrix is applied with the structure shown in 
Fig, 9. In this several elementary learning matrices can be dis- 
cerned with exits additionally running through an maximum selector. 
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Fig. 9: Classifier for 1 Category with Linear Differentiation 

Function fll] 

Key: a. Characteristics veotors 

b. Placement 

o. Differentiation functions 

d. Summation instrument 

e. Weights 

f. Discriminators 

The placement of a characteristics veotor Y, in the oategory i 
is correct when the differentiation function g^T) for this vector 

beoomes greater than all other g^Y). In case any other weight 

function should beoome greater, the pattern is plaoed in an errone- 
ous class, with the result that the weight vectors W^ and have 

to be altered in pairs. For the correction the already known 
equations are applied (Wj » + o.Y or - Wj - o.Y). 


20 





4.3.2 Example for Classification of Decision Making Sltuatlona /26 

For matters of simplicity the example used in Section 3.2 for 
demonstrating revision of probabilities "submarine location" is 
further developed here. It oan be seen from tl. representation 
of probability matrices for position "submarine" and "whale" (com- 
pare Table 1, p. 9), that the taotioal situation on whloh a deci- 
sion is based is described in each case by an eight-dimensional 
characteristics vector F; 



This charaoteristios veotor may assume any number of values in the 
course of time. On the other hand only 8 different aotions £1 - 
£8 may be chosen: 

.El : Search with helicopter in quadrant 1 

£2 : Searoh with helicopter in quadrant 2 

E3 : Searoh with helicopter in quadrant 3 

£4 : Search with helicopter in quadrant 4 

£5 : Searoh with sonar buoy in quadrant 1 

£6 : Searoh with sonar buoy in quadrant 2 

£7 : Search with sonar buoy in quadrant 3 

£8 j Searoh with sonar buoy in quadrant 4 

The task of an operator consists in making one of eight possible 
decisions after observation a certain situation characterised by 
a charaoteristios vector P. On the basis of his experience he will 
chose solution £,, which holds promise for him of the greatest 
( reoonnoissance) 1 usefulness EU,, . Corresponding to Section 4.1, 
p. 14 - *1 


“V - P(H U ) X £U u1 ♦ * p(H w ) x EU 

II 4 4 

The operator therefore carries cut a classification task as was 
dealt with in the previous section: classification of patterns 

in eight categories. 

Notice that the differentiation function g,(X) is constructed 
in the same manner as the equation for the total expected useful- 
ness of a decision under conditions of uncertainty: 

d / 22 

9 " X w i **i + ' v d+ 1 

l 


The arrangement demonstrated in Fig. 9 may therefore be inserted 
as adaptive classifier capable of being trained. 
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The meaning of an expected usefulness EU, as is subjectively 
estimated by operator (16J, may be attributed to the individual 
weights. Continuously deoisions of the operator aro observed for 
classifier training and assumed as desired values (desired place- 
ment ). If no agreement oan be made, a correction of weight factors, 
as described in the previous section, is made. When all tasks have 
been learned, the olassifier makes the deoision which the operator 
would also make. This arrangement therefore does not supply abso- 
lute optimal solutions but merely copies the operator behavior, 
lnoluding his mistakes. 



Fig. 10: Classifier for the Example 

"Submarine Looation." See 
Text for Details (altered 
according to [16] ) 


EU. , ■ Subjective eatima- 
tion of usefulness 
resulting in this 
situation from the 
information on quan- 
tity of p(H ). 

EH m H n * 10 

Ei Expeoted usefulness 

of deoision Si, 

Key: Note: All "EN" in this 

figure are understood 
as "EU". 

a. Choioe of deoision 

with EU mnv 
max 

b. Deoision of classifier, 
o. Comparison of deoisions. 

d. Decision of operator. 

e. Training/correotion. 


4.3.3 Characteristics of Adaptive Usefulness Estimation /28 


A special advantage of the method described is that usefulness 
estimation of the operator is determined here dynamically in a 
genuine situation and actual time. The arrangement is thus capable 
of reacting continuously to alterations in strategy or decision 
style of an operator, dependent on growing experience, lessening 
concentration or alteration of boundary conditions of problem, 
among other things. 

Experiments show that thio training rapidly leads to a lessen- 
ing in error quotient under constant boundary conditions when the 
operator maintains consistent behavior. In general a stabile con- 
dition is achieved after a few decision cycles. 
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During the learning process two typical phases ore differ- 
entiated (Fig. 11): In the beginning a rough adjustment occurs 

with a oorreotion of weight factors after each decision, Subse- 
quently corrections only ooour after several decision cycles in 
each case. This second phase therefore represents a fine tuning 
and an adjustment to scalier variations in operator behavior. 



Fig, 11: Number of Corrections of Usefulness Estimated Values 

as a Function of Deoisici* Cycles [17] 

Key: a. Number of Corrections per Decision Cycle 

b. Number of Decision Cycles 

As soon as the classifier training has achieved the phase of 
fine tuning, and this is the case after a few cycles, most decisions 
of the operator are correotly predicted. When the strategy of the 
operator changes, a new adjustment process of classifier begins, in hi 
which the learning matrix takes some new information more strongly 
into consideration than others and reacts in a manner similar to the 
human memory, 

A normative significance may be attributed to the learned, es- 
timated values of usefulness. The deoi3ion predictions of classi- 
fier based on these may be indicated to operator, who then employes 
them as decision-making aids or for the control of his decision- 
making behavior, whereby a closed circuit control is created as 
is demonstrated in Fig, 12, 

4.3.4 The AODAK System 1 ^ 

The example M submarine location” in Section 4.3,2 was inten- 
tionally simplified in order to show the adaptive usefulness es- 
timation better. Freedy et al. [18] conducted studies with a much 
more complex reconnoissance system, a relatively realistic experi- 
mental situation in which various decision aids may be tested in an 
actual time situation. 

1 ^ AD JAM » Adaptive Dynamic Decision Aiding Methodology 


23 



Fig. 12: Deoision Aids and Controls on the Basis of Adaptive 

Usefulness Estimation in form of Closed Cirouit Control [18J 

Key: a, characteristics veotcr b. display 

(decision making situation 

o, operator d. deoision 

e. deoision of classifier f. adaptive pattern - 

olassifier 

The reconnoissanoe situation seleoted is shown in Fig* 13* 

The position and direction of an enemy submarine and a ceoond 
neutral object (whale) in an area of the ocean covering lb seotors 
is to be determined. For reoonnoissanoe a total of eight different 
sensor types may be positioned, in eaoh sector, however, only one* 

The sensors differ from one another in sensitivity, reliability and /30 
in costs. One sensor is able to oearoh the entire sector in whioh 
it was positioned in eaoh case. Position reports are not transmitted 
continuously, but in intervals of 15 minutes. 

Operator performance in this recnnnoissanoe task is measured 
by means of an index taking into oonu^deration oorreot and incorrect 
position reports as well as oost for the sensors deployed* This 
index is indicated to the operator and is the basis for reward at 
the end of the trial. 

Aim of the experiments oonduoted was to test effeot of adap- 
tive deoision aids on operator performance (test of ABBAM). The 
system contains two intelligent deoision aids - 

1 , Help in gathering information - 

After each new report of deployed sensors a list of 
revised probabilities for target positions is processed 
and displayed (Bayes* Rule), 

2, Help in selection of action (placement of sensors) - /31 

The computer determines expeoted usefulness of placement 

of a certain sensor in a certain seotor for the operator 
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CV Aircraft Carrier 


S Enemy Submarine 






Fig. 13: The Situation Employed for Experimental Examination 

of ADDAM [1QJ 

on the basis of adaptive usefulness estimation. 

Plaoement suggestions may be taken from a list of priorities 
for expeoted usefulness prepared by the oomputer. Suggestion 
for action by the oomputer represents the sensor plaoement 
with greatest expeoted usefulness. It is left to the dis- 
ore tion of the operator to deviate from this plan. 

A blook diagram containing both desoribed oomputer aids is 
givon' sohematioally in Fig. 14. 

Experiments demonstrate that the adaptive model already pre- 
diots the deoisions of operator with a great degree of aocuracy 
after a short period of time (correlation 82 £). Quality of deoi- 
sions measured in the above indioated index is doubled. With oom- 

g uter aids higher sensor oosts may be inourred, but the number of 
its increases and there are fewer mistaken reports. The system 
is gladly aooepted by the experimental personal and regarded as 
intelligent interactive partner after some praotioe. It may be 
decisive in this prooess that the trial personnel is informed about 
how suggestions of the oomputer result and that this is substanti- 
ally a copying of their own behavior. 

Operator deoisions are more consistent with the aid of the , 
computer, the Interindividual variations are also reduoed. In 
addition it is also observed that the deoision oyole runs a more 
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Fig. 14s The ADDAM System [18] 

Key: a. Gaining information 

0. Recommendation for action 
e. Revised hypothesis 

g . New data 

1. Operator 

k. Surroundings 
m. Result of action 


b. Probability revision 
d. Model for expected usefulness 
f. A priori hypothesis 
h. Display 

j . Selection of action 
1. Result 
n. Time delay 


rapid course due to computer assumption of bookkeeping and sorting. 
Computer bookkeeping is a valuable support for human memory and con- 
centration. 


It is concluded from the experiments that the system desoribed 
is especially suitable for tasks in wliioh general guidelines may be 
given but the individual action is left to the operator. 

It is left to remark on the value of the measures suggested by 
the computer. The rank series provided by the computer corresponding 
to expected usefulness typically shows that the first 3-5 sugges- 
tions* in the list have a substantially higher EU value than the 
subsequent remainder. This result in various decision cycles and 
trial personnel. Since the placement suggestions of lesser priority 
have a lov/er expected usefulness but cost more, these suggestions 
may be suppressed by the computer in order to keep less helpful in- 
formation from the operator who is already under great stress. 


4 .5.5 Task Distribution Man-Computer on the Basis of Adaptive 
"usefulness Estimation 

Systems containing intelligent adaptive components capable of 
learning will be able to take over many tasks in the future which 
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must now be carried ova by the operator. Typical design is given 
in Pig. 15 in whioh parallel to the conventional man-man control 
circuit a second loop containing an adaptive oomputer is shown. 
This learns by means of observation of system behavior and is in 
a trained condition oapable of taking action or giving recommenda- 
tions on the basis of stored experience. Systems using this con- 
cept are already being applied. Typical examples are remote-con- 
trolled missiles, intelligent rookets, teleoperatcrs and robots. 


Fig. 15: System with adaptive Components capable of learning [ 19 ] 

Application occurs in all these cases in non-deterministic 
situations characterized by a risk attached to all actions and 
decisions. This is valid for human aotions as well a3 for those 
suggested by computers. 

The operator is assisted by a system generating its own be- 
havior patterns in time and giving suggestions which have only a 
certain degree of confidence. 

The success of such an arrangement depends on whether the 
computer can win the operator's confidence by means of correct 
prognoses. When the operator realizes that the computer has 
learned enough to be a reliable aid, he will delegate an increas- 
ing amount of decision making tasks. 

An example for an adaptive aid system capable of learning 
is indicated by Freedy et al., 1975 [19,21] and by Crooks et al, 
1974 [203 (ACS - autonomous control system). This is a system 
applied for learning a matrix of dependent probabilities. The 
matrix permits prognoses via observation of present conditions 
and past system conditions. The probability of a correct progno- 
sis increases with number of observations. The system decides in 
each case in favor of the action alternative with greatest pro- 
bability of sucoess. 




The manner of function of this system is explained here using / 34 
a greatly simplified example (Pig. 16;. The task consists in assis- 
ting a missile to a avoid an obstacle, and resulting collision. The 
task difficulty results from the threat probability represented by 
the obstacle, a non-linear function of distance from the missile 
(Fig. 17). Threat profile is known to the operator because of pre- 
vious instruction. On the basis of this knowledge he attempts to 
place the missile on a path of most security, Depending on where 
the obstaole may appear he can select from 10 possible aotion al- 
ternatives (possible linear flight paths). 
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Pig. 16: Description of the 
Task applied by 
Crooks and Preedy 
[19,20] 

i ■ Obstaole path 
1 » Missile path 
D ■ (i-j) Distance 
between obstacle 
and missile 


Pig. 17: Throat Profile of an 
Obstacle ll9 t 20] 

Key: a. Hit probability 

b. Distance of obstaole 
to the missile 


Key: a. Obstaole 
b. Missile 


The computer observes actions of the operator and resulting /35 
consequences (hit/no hit). All possible combinations for the 
path i of obstacle and j of missilo are a priori equally pro- 
mising of success for the computer, expressed in an internally 
stored probability matrix (Pig. 18;, showing in all fields above 
the value 0.1. The matrix is first untrained. 
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Pig. 18s Matrix for the Probability 


of a Hit 


when the 


Obstacle is on Prth 1 and the Missile on Path ;) 


Key: a. Path j selected for missile 

b. Path i observed, of the obstaole 

c. Posterior 

d. Probability for a hit 

For the situation shown i ■ B, J ■ 5, a hit probability of 
70 ^ may be read from Fig. 17 in the case of a hit report. With 
*'ts additional information the revised probabilities noted in 
the lower portion of column 8 in the matrix elements result (com- 
pare Seotion 3.1). It can be seen that for the next trial with 
this 8-5 configuration 9 alternatives are still equally promis- 
ing of success ( p-0.09). The learning process is continued with 
each observation. In time there is then for each configuration 
raissile/cbstaole a path j of relatively low hit probability. That 
is the path which the computer v/ould ohose. Reliability of this 
ohoioe depends on the absolute value of corresponding probability. 

The p ^-matrix accumulates all information transmitted continuously 
in this manner. It continually becomes more clever in this manner 
and forgets nothing. When boundary conditions (threat profiles) /J>6 
remain constant, it performs better than the operator after a de- 
fined point. 



While the examples described may still be readily surveyed, 
the relationships studied by Crooks and Freedy using three obstacles 
simultaneously is substantially more complex. 

For task distribution between operator and computer there 
first exists the possibility of determining a defined level of 
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confidence. As soon as this level is exceeded the computer auto- 
matically assumes control. It is clear that determination of this 
level is decisive for system effeotivity - when the level is low, 
oomputer deoisions are speculative and liable to be erroneous} 
for high levels the oomputer is applied only seldom. 

A second possibility consists in leaving the deoision about 
task distribution to the operator where necessary, 'l'he disadvan- 
tage of this possibility is that the operator usually does not 
carry out this distribution optimally beoause he underestimates 
usefulness of oomputer assistance and his own work load. The op- 
erator has the choice of one of two lotteries - 

1 • To act himself with a defined probability of suooess 

2, Delegation of decision to the computer wnich also pro- 
mises a certain probability of suooess. 


Successful Unsuccessful 


Lottery 1 


computer deoides 


Lottery 2; operator decides 


in each decision making situation the operator makes subjective 
estimation of usefulness of all four possible consequences, re- 
sulting in expected usefulness (EU) of the two lotteries: 


The operator will chose tile lottery with the greatest usefulness 
expectation for him. 

Tlie values EU^^-j and EU Lq+2 must be determined for an auto- / 

matic adaptive task distribution between man and computer. It may 
be recognized that this again is a problem in classification. 
Characteristics vector P - IPre p RM p OE p 0.’.v should be arranged in 
two categories. For this the usefulness parameter EN f en re en oe en 
(Note; read "EU” for "EN" ) are to be learned by observation of 
operator behavior (compare Section 4.3.1). The weight vector wnich 
actually has no meaning in the case of pattern classification obtains 


in this application case the significance of usefulness expected sub- 
jectively by the operator from a certain possible consequence. 

Where the characteristics vectors may be separated in a linear manner 
and when the operator acts in a completely consistent fashion, the 




training results in disore t EU values. In reality remainder errors 
are observed, caused by memory and Information processing mistakes 
of operator and by the faot that the patterns may not be separated 
in a linear manner. 

The desoribed method adaptive task distribution man/oomputer 
has proven itself in experiments. It may be shown that operator 
prt ferenoes are substantially influenced by oosts in connection 
with the ohoioe of a lottery. Usefulness estimation of oomputer 
decisions increases when oosts of operator deoisions increase 
and vioe versa. 

The indication of oonfidenoe level, i.e. of learning condition 
of classifier, intensifies the application frequency of automatic 
oontrol systems considerably. The operator then has a better poss- 
ibility for estimating the chances of success of the computer. 

5. Computer Assistance in Problem Solving /38 

5.1 Problems of the Type "Game against Nature 11 

The task of an operator in guidance and oontrol systems fre- 
quently consists in oonduoting a system from a known beginning con- 
dition to a desired target condition. During the transition vari- 
ous boundary conditions (game rules) have to be adhered to as a 
rule. For this problem type numerous examples may be listed from 
the area of oontrol technology, operations researoh, navigation, 
distribution of goods, procedure guidance eto. 

An example from naval navigation is explained more closely 
here especially suitable for the problem solving procedure sub- 
sequently described! 

A freighter ic navigating in an heavily trafficed water way. 

The ship represents a dynamic system with a condition vector 

described at time t by a differential equation [ 25 ] : 

d 5 

ir m a* »h 

In this equation £ (*' ^ describes the coordinates of looation 

and u (v,$) describes the steering vector including velocity 
and ‘bourse. 

Explicit equation for ship's motion: 

■^2- * v(») cat $(t) ; v M *ln$(t) 

at at 

For the motion of other ships in the waterway the analog equation 
is - 
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dll' /* • 9 k (i k ,a k , t ) 


With this formulation the problem may be defined as follows: ob- 
serve the traffio situation, i.e. the oosition and motion of other 
ships and determine a oontrol strategy u(t) based on this observation 
in suoh a way that your own ship is transported from a starting 
point ioto a target point . Boundary conditions in, this case 
are that a minimum distanos (Amin) must be maintained at all times /39 
(avoidance of collision) and that travel is kept to a minimum. 

Solution of this problem in closed form is possible (variation 
oaloulation) but difficult. Realisation is often not carried out 
beoause calculation may not be conducted in aotual time. 

5.1.1 Problem Representation as Decision Tree 

Alternatively to the closed mathematical treatment the problem 
may be solved by means of a systematic analysis of successive system 
conditions (dynamic programming) (24] • Procedural method in the case 
of dynamio programming consists in a step-wise and continuous genera- 
tion of new system conditions beginning with the starting condition. 
Boundary conditions and rules are to be observed in the transition 
to new system phases. Where the problem has a solution, the desired 
final condition will be found among the variants sooner or later. 

Once the target condition 4 a reaohed, the path to solution may be 
retraced to the starting condition. 

Graphs are suitable for representation of possible transitions 
between system conditions. The points of graph represent system 
phases, the connecting lines between the points are the operators 
conducting from one condition to another. The decision tree for a 
puzzle with 8 pieces is given as an example in Pig. 19 [25] • 

The game rules may be derived from the decision tree: shift 
the pieces in such a way that they come to lie in the correct order, 
compare the target point. 

A form is to be chosen for problem representation within the 
graph, economical as possible, i.e. fitting to the problem. Symbol 
chains, vectors, fields and lists are possible. 

Graphs are either represented explicitly in the form of tables /4Q 
or implicitly via an algorithm, indicating in what manner the points 
of the graph are to be developed. In each case the portion of the 
graph leading to target point must be made explicit. Algorithms 
for development of graphs contain non-deterministic distributions 
and branching. 

5.1.2 Searching the Decision Tree for a Solution 

Problem solving consists in searching the graph for possible 
paths to the target phase while observing certain criteria such as 
path length and time. For this search application may be made of 
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Pig. 19s Decision Tree for a Puzzle with 8 Pieoes 

Key: a. limit depth 

b. searoh "in breadth" 
o. point system phase 
d. starting point, 
beginning condition 


e. order of de/elopment of 
the tree 

f. target point 


either exhaustive ' or heuristic^' techniques. 

Complete procedures systematically searoh all branches of the 
graph until a solution is found or the computer capacity is exhausted 
For this purpose varying strategies may be applied. In the case of 
the searoh "in breadth" first all possible transitions are developed 
from one condition. An example for this is given in Pig. 19. In the 
case of the searoh "in depth" first a solution path is followed to a 
defined limit depth in the graph. Where no solution is found until 
then, the search is continued "further above" and again developed 
"in depth". With the two above named strategies paths through the 
graph are developed of equal length or equal cost. The parameter 
c(n if nj) designates the costs for the transition from one point n^ 

to a point n^ . The designation g(n) is chosen for the cost of the 

total path from beginning point s to the poinv u. 

With respect to the interactive solution of problems in man-machine 
systems the heuristio search methods have more significance than the 
exhaustive procedures. Heuristic information is employed in setting 
priorities with respect to the order in which points are to be devel- 
oped. Heuristic information concerns special assumptions on the 
problem to be solved, suitable for simplifying the solution searoh, 


Exhaustive » complete 

Heuristic » "serving the discovery" 



r 

II 



m 
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including intuitive assumptions, rules of thumb, generally valid 
principles and plausibility conclusions. 

The heuristic search for a solution therefore represents a 
combination of systematic and intuitive procedures. That point 
of the graph, leading with the greatest probability to the target 
point, is developed first. The various points which may be seleoted 
are arranged in a rank series by means of an evaluation function. 

The evaluation funotion f(n) indicates the oosts of a path of 
minimum oost leading through point n. The oosts consist of two com- 
ponents: 

f (n) - g(n) ♦ h(n) 

g(n) » Cost of the path of optimal oost from starting point to 
point n. This value may be determined from the graph 
developed to point n. 

h(n) ■ "heuristic function." Indioates the oosts of path of 

optimal oost from point n to the target point. This value 
is estimated on the basis of heuristic assumptions. 

In the oourse of an ordered search the point of the graph /42 

with the lowest f(n) is developed first in eaoh case. Aa an ex- 
ample for this procedure the 8 pieoe puzzle in Fig. 19 is to be 
again taken into consideration. It is apparent that not all system 
phases represented there promise the same suooess, reaoh the target. 

A heuristic funotion h(n) is being looked for, expressing this situ- 
ation. 

The simple heuristio funotion intuitively seleoted for this 
example under consideration of the game rules evaluates the number 
of falsely plaoed pieces at the point n. The value h(n) equals 4 
is calculated for the beginning point (Fig. 19) using this assumption. 

The oosts of path g(n) are in addition included in the evalu- 
ation function to point n. In the rase at hand the costs are set 

equal to number of transitions to point n. For the three points 

subsequent to starting point in Fig. 19 the evaluation functions 
6,4,6 (see Fig. 20) are oaloulated with this rule. The point with 

the lowest f(n) is further developed in eaoh oase. It can be seen 

in the deoision tree section in Fig. 20 that the heuristio has con- 
siderable reduced the search for a solution. 

A further example for the same game, but with a complicated LAI 
beginning situation, is represented is Fig. 21 • The evaluation 
funotion is defined intuitively in this oase as follows: 

f(n) • g(n) + h(n) , where h(n) - P(n) -h 3S(n) 

Here S(n) is the number of successive errors in game pieces, 
counted during a single circuit. 3(n) increases by "2" for each 
case in which the piece with correct number does not follow. Chips 
in the middle count "1". P(n) indicates the sum of distances which 
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starting 

point 



Evaluation function: f (n)-g(n)+h(n) 

g(n)-path length to point n 
h(n)-number of pieoes with false 


position (oosts of point n 
to target point) 


Number example for point no. 2: 
f (n) -1+3-4 


target 


point 


Fig. 20: Deoision Tree for Puzzle with 8 Pieoes Developed 

via "Ordered Searoh" [25} 


each piece is removed from "its" location. The procedure is made 
clear by means of a number example for the following configuration: 


rnr 


ui 


S(n) • 15 
Game pieue no. 


h(n) - 11+3 x 15 - 56 


1 

2 

3 

4 

5 

6 

7 

8 


Distance from desired 
position 

1 


1 


1 

1 

3 

0 

2 


Sum 


P(n) 


11 


Therefore it follows that 


h(n) - P(n) + 3 S(n) = 11 + 3 x 15 - 56 








-1M 
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Pig. 21: Decision Tree developed by ordered Search with the 

Evaluation Function f(n) = g(n) + P(n) + 3 S(n) [25l 

It is again clear here that the ordered search leads to a /44 
solution after a finite number of steps. There is, however, no 
assurance in this case that the solution path found has the lowest 
cost, as opposed to the blind, complete search. 

The costs of a solution usually include in addition to the 
cost of solution path, the search oosts. When these combined costs 
for method 1 are less than for method 2 it is said that method 1 
has more "heuristic power". The "heuristic power" is determined // t j 
by three factors: 1 — 
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- cost of solution path 

- number of points developed 

- amount of work in calculation of heuristio 
function h(n) 

An attempt is made to deal with problems entailing high oosts 
only in a few oases, so that in xhe oase of routine tasks low oosts 
are inourred. In many oases, however, calculation of oombined 
oosts is not possible, then only the intuitive oholoe of an heuris- 
tio approaoh remains. 

The procedure of dynamic programming desoribed may be employed 
for interactive solution of problems in man-machine systems in suoh 
a way that certain searoh strategies are put into the oomputer for 
which it then independently determines a suitable solution in eaoh 
oase. Solution alternatives and aotion consequences may be estimated 
in this way from the beginning and for various strategies, Decisive 
contribution of operator oonsists in the effective formulation of 
evaluation function so that desired strategy and solution approaoh 
are expressed oorreotly in thl3 formulation. In the case in which 
e.g. any solution without consideration of cost is sought, g(n) • 0 
may be inserted. In oases in which h(n) is not adequately selected, 
the danger then exists that no solution will be found. When g(n) 4 0, 
in time all parts of the graph (i.e, also in breadth) will be searched. 

Dynamic pro jamming for application in man-machine systems re- 
mains limited to suoh algorithms and problems which may be dealt with 
in approximately actual time. 

5.1.3 Ship Navigation as Example for a Multiple Step Decision Process 

In this section the solution for navigation ta*k defined in 
Section 5.1 is carried out with the above explained method of dyna- 
mic programming. As a first step the possible paths of the own 
ship from beginning to target point are formulated, resulting in tek 
a graph as in Fig. 22 if it is assumed that the ship moves forward 
in each.section in a linear manner and at constant velocity. The 
point x* in the graph represents possible ship positions on the 

way to the target. On each of the N levels there are a finite num- 
ber of possible positions k . The amount of possible system phases 
is therefore: 

1 k 

* = { * n ) mft n ■ 0-N 

n t» n 

The geometry of the graph is determined by the consideration 
that deviation from correct course is to be limited where avoidanoe 
manoeuvers are undertaken, Further determining factors are natural 
obstacles and computer capacity. The maximum ship velocity is 
chosen under consideration of waves and currents, permitting cal- 
culation of expected arrival times for all points of the graph. 

Costs for phase transitions are calculated with transition time. 
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Levels 

i — 

m 


Fig. 22: Graph with N levels for representing all possible 

paths of the ship from the from the position of 

departure to the target ( 23 ] 

Position and motion of other ships in the waterway ("targets") 
are known beoause of radar measurements and oan be calculated for 
oertain times. For simplification it will first be assumed here 
that the targets also move forward in the seotion in a linear manner 
as oan be expeoted from merohant marine ships. 

When considering Fig. 22 it is demonstrated that the graph 4 
is not a oomplete net. From a certain condition x not all phases 
of the leven n+1 may be aohieved. The plausible reason for this 
situation is again that the deviation from correct course cannot 
beoome randomly large. 

For avoidance of a collision a minimum distance between ship 
and target must be maintained at all times. In order to define 
this distance the targets are observed periodically. Observation 
interval is in this oase very much shorter than the transition be- 
tween two points on the graph. On the basis of the last observa- 
tion in eaoh case a range is calculated for er.oh target which oan 
ba aohieved before the next observation is taken. If this calcu- 
lation is made on the basis of motion equations of targets, per- 
missible steering vectors q(v, i) and measuring error of radar. 

All non-permissible areas comprise a contour variable in 
time, the penetration of which oould lead to a collision. The 
minimum distance ship/targets is calculated for all points and 
also for transition times between the points. Transitions not 
meeting the condition 6>6 m , n are disregarded. Note that after dis- 
covery of a collision danger in the transition to x* the entire 

path from start to that point may be eliminated. The original 
graph from Fig. 22 is greatly reduced by these calculations. 



For all remaining paths without danger of collision the en- 




tire costs (here travel time) are now determined from start to the 
target and the fastest path Is lndloated as solution to the naviga- 
tion problem. Fibres 23a. and b. show trajectories determined In 
this manner. Fig. 23 b. demonstrates the recommended course oorreo- 
tlon beoomlng necessary when three targets alter their oourse. 

Since there Is no assuranoe that the simplifying assumptions 
about ship movements are accurate at all times, after eaoh radar 
observation It must be examined whether deviation of the ship still 
remains within the seoure time toleranoe and whether the targets 
behave as predicted. Fig. 24 shows the program oourse for this 
penodio examination. 





Fig. 23 a. : Example for Navigation optimal in Time in the 
Presence of 6 Ships [25J 


r. 








Fig. 23 b. : Altered optimal Strategy after 3 Ships have 

changed Course [25] 

Note in this case that the operator is alarmed in the next 
section only in the case of immediate collision danger. In oases 
of conflict lying further in the future the computer first attempts 





to find a 8olution itaelf. 

The navigation prooedure described was developed for applica- 
tion in the Italian ship "Lloydiana" , This system constructs a 
graph of 22 levels and takes into consideration up to 40 targets. 
The radar searoh interval was 2-3 minutes in this oonorete case 
and oaloulation time was approx. 12 seoonds. 


own ship's and targets 

UPDATED II NEMATIC PARAMETERS 


ARRIVAL TIME TO THE NEXT 
STATE IS COMPUTED 


DELAY 

LESS THAN TOLER AW 


IS 

ANY TARGET OUT OP 
^PREDI^nONS^ 


OPTIMAL POLICY IS RATIFIED 


CHECK COLLISION DANGER ON 
ACTUAL AND SUBSEQUQJT STAGE 


OPTIMAL POLICY AND UPDATED 
TARGETS' SITUATION ARE S'lORED 


IMMEDIATE DANGER 


ALARM 

MANUAL OPERATION IS REQUESTED 


DANGEROUS 

INTERFERENCE 


CHANGE POLICY 
GO TO MAIN PROGRAM 


Plow Diagram for Actual Time Optimation of Ship's Path [25] 







Because the strategy may be ohanged, the risk altered and /49 
alarm threshhold varied In a simple manner, this procedure offers 
considerable advantages for the interactive application in man- 
maohine systems. Other application possibilities are obvious. 

The computer-assisted optimation of air- traffic supervision and 
guidance should be mentioned. Work of air-traffio controllers 
could be substantially simplified if they receive suggestions 
from the oomputer on how holding loops could be avoided, which 
plane should accelerate for this and whioh should slow down etc. 

An example for this case is explained in the next seotion. 

5.1.4 Computer-Assisted Alr-Trafflo Control /bO 

When a piano flies over land according to instrument rules, 
it is handed from one path control to the next. Task of air- 
traffio controller is to separate planes within his range of 
competency and to hold them on the desired course. Numerous 
boundary conditions have to be observed in this process. When 
a plane does not adhere to one of these boundary conditions, a 
oonfliot arisep which must be resolved by corresponding directions 
from air-traffic controller to the pilot. 

There are usually many various possibilities for solving the 
conflict. The quality of air- traffic controller work may then be 
estimated by the number and type of commands which he gives to the 
pilot via radio and by travel time of the planes. 

This task i3 in times of heavy traffic very strenuous as is 
well known. It becomes especially difficult because expected 
events do not occur or not in the expeoted manner. Planes deviate 
from given course and height, do not maintain speed and new planes 
appear in the control range. 

The experienced air-traffic controller has developed internal 
models permitting him to make predictions because of actual observed 
situations. His experience also puts him in a position to test 
hypothetical actions in a shortened time measure with his internal 
models. For the controller learning means in this connection re- 
ducing the number of surprises with which he is confronted in real- 
ity by means of continuous improvement of internal models. 

It is possible with the method of dynamic heuristic programming 
to design a corresponding external decision model for assisting the 
controller. This model supplies action recommendations with exact 
time data on the basis of an heuristic calculation. The principle 
of implementation of this model is demonstrated with the following 
simplified example illustrated in Fig. 25 - on the basis of a 
traffic situation observed at 17:10 the further course of develop- 
ments in the case of boundary conditions assumed unchanged is cal- 
culated to 17:30. At 17:20 the picture given in the figure is 
received. Here radio fixed points (hexagons and + signs) and flight 
paths can be recognized. The broken lines mark the .limit of an in- 
coming control area. N111JD is a Lear Jet flying at 400 knots, 

SWAN-1 a military jet at 600 knots, N3953T a single motor Piper at 
200 knots and AA-21 a Boeing 707 of American Airlines at 400 knots. 
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All maohinea are flying at a height of 11 f OOO feet 


PLAN-AHEAD WORLD 
N111JD / 

.no Jr 


17 > 20:00 


Flight paths 


SEPARATION CONFLICT: N111JD SWAN-1 

SEPARATION CONFLICT: W3953T AA-21 

AA-21 TOO HIGH FOR APPROACH HAND-OFF 
EXPECTED AT 17:30 


An Air-fraf f J o Situation calculated 10 Minutes in 
the Future a 11 ; 17:20:00 [ 26 ] 


In the range of path control the following three situations may 

occur, requiring sin action on the part of the controller: 

a. Distance between planes 1 and j must longitudinally amount to 
at least 5 miles, otherwise a difference in height of at least 
1 ,000 feet Is required. This conflict may be resolve by the 
following alternatives - 

- climb 1 ,000 feet 

- reduoe altitude by 1,000 feet 

- a turn to the right 

- a turn to the left 

- reduce speed 

- fly a holding pattern 

b. When the next radio fixed point is an incoming control station, 
instructions on altitude may not exceed 6,000 feet. Corresponding 
instructions: 

Reduce altitude to 6,000 feet or less, 

c. Planes should not deviate from desired course by more than 4 

4 miles. Possible instructions to pilot: 

- Return to desired flight path 

- Fly in the direction of the next radio fixed point 
if this is no further away than 40 miles 

- First wait and give no instructions 


It can be seen iri Fig. 25 that there is a separation conflict 
at 17:20 between N111JD and SWAN-1 as well as between N3952T and 
AA-21 . A conflict in altitude may further be predicted for AA-21 
at 17:30, since this plane then approaches incoming control range. 


Pig. 26 shows the solution of the separation oonfliot between 
Piper and Boeing 707 by means of a decision tree. Vertical lines 
signify here a course of events unaltered in time (vertical time 
axis), horizontal lines mark point in time and the type of altera- 
tion examined. 

It can first be seen in the middle of the figure that the se- 
paration conflict 1 is determined at 17:20, for solution of this 
conflict all logical actions are played through on the deoision tree, 
evaluating the final points again in the manner already explained. 

for example a possibility consists in reducing Piper speed 
from 200 knots to 150 knots, This action would have to be taken 
already at 17:10. The same effect may be achieved, when the Boeing 
707 reduces speed to 300 knots at 17:12. Both solutions, however, 
receive a low evaluation and are rejected. The complete develop- 
ment of decision tree supplies finally the optimal solution, setting 
the Piper on course 270 at 17:17. 


A similar treatment of the other two conflict oases finally 
supplies the following computer suggestion: 


17:17 N3953T 
17:19 N111JD 
17:24 AA-21 
17:25 N111JD 
17:28 N3953T 


Course correction to the right 45 degrees 

Reduce altitude to 10,000 feet 

Reduce altitude to 5,000 feet 

Reduoe altitude to 6,000 feet 

Course correction to the left 45 degrees. 



Pig. 26: Decision Tree for Avoidance of Separation Conflict 

between Plane N3953T and Plance AA-21 £6) 1 ) 


TT 


The numerical values at the final points have been determined 
here by means of a function not explained in detail here. 
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This suggestion may be employed by the air-traffio controller / 33 
for examining his own deoisions. A direct transmission to the 
pilot is also possible, 

A much more complex system of this type was studied experimentally 
at the University of Texas, comparing especially performance with that 
of air-traffio controllers. The computer model made better deoisions 
in 16 of 19 oases and in no oase worse them the controllers. In an 
espeoially diffioult situation the computer required only 15 instruc- 
tions for the pilot while the controller needed 40. 

It may be concluded that suoh systems could be advantageously 
applied to praotioal situations. Large problems , however, above 
all of a psychological nature, are to be expected for the pilots, 
air-traffio controllers and general publio. It still must be in- 
sured teohnioally that the computer recognizes in time when it oan 
not handle a problem and gives an alarm signal to air-traffio con- 
troller, before it is too late, 

5,1.5 Interactive Problem Solving via Problem Reduction / 5 4 

In many oase the problem to be solved permits so many varia- 
tions that a complete decision tree could not be developed and 
searched. Determination of solution by means of complete permuta- 
tion either takes too long or costs too much. Sometimes the task 
may be solved via division into partial problems (problem reduction). 

In this process the operator develops solution plans intuitively 
and delegates to the computer the solution of suoh partial problems 
as he deems necessary. 

While fully automatic systems confronted with unsolvable tasks 
often react in an unexpected way, the operator retains his grasp 
of the solution path. The user controls the actions of the compu- 
ter interactively so that the heuristic capabilities of the man 
and the algorithmic advantages of the comouter complement in an 
optimal manner. 

Tasks especially suitable for this type of problem reduction 
may be represented graphically. In such cases the unsurpassed 
capability of man to classify patterns is put to use. 

As an example an extensive problem of the type "Travelling 
Salesman" is considered. The task consists in visiting 68 cities 
one after the other in such a way that the total travel distance 
is kept to a minimum. Optimal solutions of this task cannot be 
found in a reasonable amount of time employing decision trees. 

Solutions applying heuristic approaches lead to jolutions not com- 
pletely optimal and take up approx. 22 minutes of computer time. 

For the solution of this task by means of problem reduction 
the geographical position of individual cities is first displayed 
on a screen. The operator develops intuitive solution plans after 
observing the situation. A first plan consists in compiling groups 
of points to "superoities" , drawing polygons around groups of cities 
with a light stylus, corresponding to the pattern classification 
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promising the greatest suooess to the operator* The sub-problem 
arising in oonneoting the super-oities in an optimal manner is 
delegated to the oomputer (Pig, 27 a.). 

After observation of point distribution a seoond possible 
approaoh consists in determining an optimal inner and outer tra- 
vel route which are then oonnected to form a total solution 
(Pig. 27 b.). 



a. ) Solution Plan 1 » 

Compilation of Adjacent Cities 
to Super-Cities 



b. 


Solution Plan 2: 

.Division of an Inner and 
Outer Route 



Pig. ?7: Demonstration of Interactive Problem Reduction which 

may be represented graphically (?7] 

The solution found in this manner (Pig. 27 c.) is better than that 
found by means of the above-mentioned heuristic approach. It takes 
only 6 minutes of computer time, occupying an operator, however, 
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lor approx. 05 minutes. Applications of the interactive problem /56 
reduction procedure explained for problems represented graphically 
are possible among other in navigation, distribution of goods, 
traffio problems, searching for mistakes. 

5 .2 Two Person Games 

As oompared to the game against nature one's own notions in a 
duel situation have to be calculated for the most unfavorable oon- 
sequenoe selected by the opponent instead of one of several possible 
consequences. Examples for this prooess evasive action, target aim- 
ing, taotioal situations and many other tasks occurring in guidance 
and control systems. 

Games may be divided into numerous categories 16,28) , Only 
methods for determining optimal strategy in two-person games using 
oomplete information are considered here. This game type is char- 
acterized by each player's knowledge of the game rules and the 
possible moves of the opponent (examples are checkers, chess, tic 
tac toe and go). 

A suitable formulation for representing a series of one's own 
and the opponent's moves are AND/OR graphs which may be considered 
an extension of the OR graphs (decision trees) introduced in Section 
5.1.1). The design and construction rules are expressed in Fig. 28: 




Fig. 28: Design of AND/OR Graphs [25l 

The left-hand figure symbolizes a problem A with a number of alter- 
native sub- problems. The figure demonstrates that the problem A 
is solved when either the sub- problems E and C or D and E or F are /57 
solved. H,M and F on the right-hand side are OR points (alterna- 
tive sub-problems ) . At least one of these must be solved. B and 
C in the right-hand figure are again AND points. These are marked 
by a connecting arch. 

For the representation of a game situation in an AND/OR graph 
the following notations are used for the two players: 
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PLUS (♦) 

MINUS (-) 

X Game configuration 

X + PLUS mo vo 8 first in this configuration 

W(X+) PLUS oan win on the basia of situation X 

The OR points represent the moves PLUS (the player seleots a 
move from several alternatives). The AND points represent the moves 
of MINUS (opponent). The opponent may make many moves and the con- 
sequences of all these moves must be examined. 

In the representation of the game OR (one's own moves) and 
AND points (opponent's moves) alternate with one another. The task 
now consists in searoh the AND/OR graph for a sequenoe of moves for 
PLUS, insuring that the game may be won from thi3 starting situation 
or may at least remain a tie. 

Blind search techniques seldom lead to a solution, because the 
graphs for many games become to large. The complete representation 
of a eheoker would have, e.g. , 10 4U points. One alternative consists 
again in limiting the design of graphs and the searoh for a solution 
with the aid of heuristic assumptions. This procedure does not guar- 
antee finding the best move, but at least a good first move is found. 
Then the reaction of the opponent is considered and a good move is 
then determined on the basis of the new situation. AND/OR graphs 
demand another type of heuristio search technique than OR graphs. 

One of these procedures is described in the following section. 


5.2.1 






r/m 


The MINIMAX method for searching game trees is based on the 
consideration of the most unfavorable reaction of opponent in the 
choioe of a move. That move is made which results in the relatively 
smallest of all possible losses. The maximum expected loss is mini- 
mized. 


The method is demonstrated with the aid of a tic tac toe game. 
Here three marks are to be made in a row - horizontally, vertically 
or diagonally - in a field of 3x3 fields. The player who first 
achieves this wins. 


First a game tree for a move of player PLUS (marked by x) and 
an opponent move of the player MINUS (marked by o) is developed. 
Symmetrical moves are disregarded and for the game situations re- 
sulting at the end points a value e(p) is determined on the basis 
of heuristic assumptions. In the example described here the follow- 
ing rules apply for the formation of evaluation function e(p): 
e(p) evaluates the number of still open rows and columns for PLUS 
in a certain game situation subtracting the still open rows and 
columns for MINUS. An example: 
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v 




Game Situation 


Evaluation 




i 


o 


a 


e (p) - 6 - 4 - 2 


e(p) ie set at -oo when for the next move a viotory for MINUS re- 
sults and +<i o in the opposite oaae, 

Tha course of the game resulting via this heuristic evaluation 
may be followed for three moves of PLUS in Figures 29 and 50. The 
searoh depth after a good move amounts in eaoh case to 2, i.e. one 
move and the next move of the opponent. 


Searoh depth - 2 

AND 0 m 



Fig, 29: Demonstration of MINIMAX Procedure using the Example 

of a Tic Tac Toe Game [25] 

Key: a. Starting point c. Best move 

b. Value calculated from results d. AND 

The evaluation function takea into consideration the value of /59 
one’s own move as well as the (negative) moves of the opponent. 
e(p) is smallest when the opponent reacts in the moat unfavorable 
manner. One’s own move is selected by means of the relatively 
greatest value calculated from the final results (greatest minimum). 

It can be seen from Figures 29 and 30 that the graph was devel- 
oped completely, i.e. with all branches to the limit depth given, 
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Demonstration of MINIMAX Procedure usin£ the Example 
of a Tio Tao Toe Game [25J 

Actual move of opponent 
Starting Point 
Best Move 

Calculation unnecessary 


r. • 

W&Km ' * 

* |fl 

wKk • IB 

Upl * 

L <3 


Hj ■ 


yi' 




||lp§fi | « 





before the evaluation of final points and the seardh for a solution 
begins. The work oan be considerably reduoed by oaloulating the 
evaluation during development of the graph and then using these oal- 
oulations for oontrol of further development of the graph, Fig, 31 
shows this modified prooedure. Differences become olear in a com- 
parison with Fig, 29, Point B does not need to be developed further* 
beoause the first opponent move examined supplies a value H -1 M , not 
better than the already present point A. Further moves after point B 
would either supply better values, not taken into consideration /61 

corresponding to MINIMAX method, or worse values leading back again 
to point A, 

In English literature the designation ALPHA values is employed 
for evaluation of AND points and BETA values for the values oaloulated 
from final results of the OR points. This is therefore also termed 
an ALPHA-BETA graph reduction. 



Fig. 31 : Demonstration of ALPHA-BETA Development of a Game Tree 

(Nilsson 71) 

ALPHA Values may only be replaced by greater values, 

BETA Values only be lower values [25] 

Key: a. Alpha value f, this branch is out off 

b. Starting point g, this value may only be 

0 , may only beoome greater replaced by smaller ones 

d. Beta value 

e, may only be replaced by 
a neater value 

Fig, 32 provides an opportunity for following the ALPHA- BETA 
search procedure in the case of a more complex example. The graph 
shown is developed to a depth of three moves and the value for in- 
dividual final points is indicated. The bolder type branches are 
those which are to be searched when applying the ALPHA-BETA procedure. 
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Searching the thinly drawn branches le disregarded. 

The examples explained here were kept Intentionally simple and 
dear. Ordinarily substantially mors oomplex approaches are neoessary 
for evaluation functions. It may even oocur that several moves of 
equal value are determined. Por the seleotlon of a oertaln moves /62 
additional oriteria then beoome neoessary. 


a 



Pig. 32: Example for Illustrating the ALPHA-BETA Search 

Prooedure (Nilsson 11 ) 


Key: a. Starting situation 

b. First Move 

c. Opponent 

d. Second Move 

e. Third Move 


f. AND 

OH 

g. Points 

g. This path leads to the 
largest e(p) 


A further remark: when both players searoh "equally deep" in 
the graph, the opponent then employs one level deeper than PLUS 
in predicting his move in eaoh case. MINUS as a result also plays 
another "worst expected" move than that calculated by PLUS. 


5.2 Procedure for the Problem "Missile Defense" In the Form of 
a Two-Person Game 

In this seotion the problem of warding off a missile attack 
is interpreted as a two-person game. Actions of opponent and one's 
own defense measures are represented in a graph and the adaptive 
decision aids for defender are derived from this in actual time. 
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The taotioal situation may bs described as follows [29] : 

Missile silos S i are proteoted by interoeptor rookets I i# Each inter- 
oeptor rocket pfoteots several silos dependent on their^positions, / 63 

The silos are attaoked by the opponents rookets R 1 flying in ouo- 
cession. Because of the radar range only one attacker after the 
other may be discovered. Defense task consists in applying the 
available interoeptor rookets in the most effeotlve manner possible 
for protection of the silos. The oonoreto, greatly simplified game 
situation considered here as an example is shown in fig. 33: 



Possible moves of the opponent consist in attacking either 
silo or silo S 2 . The defense may react to this by defending 

the silo under attack with an interoeptor rocket or surrendoring 
the silo, i.e. saving the rockets until further attacks occur 
(silos which have been destroyed require no further defense). 

It i3 now assumed for further development of the example that 
an attack on S 1 is discovered from radar observation (first move 
of the opponent). Possible reactions are either to destroy R 1 
(assumed hit probability ■ 1 ) or to do nothing (x), i.e. to 
surrender S., . In order to determine the ”oorreot n move of these 
two, a graph is developed with all possible future opponent re- 
actions and defense measures. Pig. 34 shows this game tree for 
three successive attack and defense actions. 

As the next step the final points of the graph are evaluated /64 
heuristioally. In formulating the heuristio function the fallowing 
parameters may be taken into consideration: 

- S Number of remaining silos 

- R Estimated number of remaining attack rockets 
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Fig, 34: Graph for three suooessive Attaok and Defense Actions [29l 

Key: a. A-attaok 

V-dtfenae 

b. Possible Attaok/ Defense Aotions 
o. Known Attaok 


- I Number of remaining interoeptor rooketa 

- C Total oroteotion of all interoeptor rookets 

(sum or the number of silos covered by eaoh I, 
redundancies are also oounted) 

- S T Sum of the silos at the beginning 

- Rm Number of opponent rookets at the beginning 

(estimated) 

• k Portion of silos to be saved (k - 0 ••• 1), 

Here are several examples for expressing different defense 

strategies in heuristio function: 

h ■ S ; Saving the silos is more Important than all 

other considerations, 

h ■ 2S+C ; Saving the silos is more important than the 

remaining entire protection. 

h ■ ISC/R ; Number of silos saved, remaining interceptor 
rockets, remaining protection and remaining 
attack rockets are taken equally into consider- 
ation, 

Rm(S-kSm) 

h ■ g ■ / ^ vr ' — * silos should remain after the battle. 


For the present example h ■ 2S+C is selected, resulting in the 
starting situation shown in Fig, 33, h-2x2 + 3«7. 

For determination of a '’good first move” the final points 
of the graph in Fig, 34 are now evaluated. With these values the 



optimal defense move may be determined aooording to the MINIMAX 
method explained in the previous section and by means of applioa^ 
tion of Alpha-Beta reduction. In the present oaae the largest 
value results for the alternative x (do nothing, surrender silo) 
with h - 2. Employing the heuristio funotion (strategy) selected 
the reoommendation is therefore presented to surrender S 1 • 

In the meantime radar has registered an attaok on S 2 (seoond 
opponent move). A decision tree is developed from the beginning 
situation given now to a dep^h defined by oomputer time and oapa- 
oity (contained in Fig. 34). As defense move an interception is 
now xeoommended - whether with I. or L, This procedure may be 
continued in this manner. 

Experimental studies show that the same defense effeotivity 
is achieved with only 60 £ of the interoeptor rookets otherwise 
required, when applying this decision aid. It is assumed in addi- 
tion that the opponent is always as well informed as the defender. 
This is not normally the oase, so that in an aotual situation even 
better results may be expeoted [34] . 

The most important advantage of decision aid described is 
that the computer suggestions are adjusted adaptively to changing 
surrounding conditions. Information gathered during the duel and 
council of experts as well as a ohange in strategy may be continu- 
ously taken into aooount in formation of heuristio funotion. By 
this means the operator receives a valuable aid in decision-making 
situations in whioh he would otherwise bo dependent on intuitive 
solutions. 

6. Final Remarks /66 

In this report various possibilities were studied for reducing 
the cognitive stress of operator in complex man-machine systems via 
application of computers for intelligence assistance. Nickerson (30) 
states already in 1977, 

"Some system builders have envisioned systems that will have 
the ability to develop very sophisticated models of their 
users. Such systems would presumably be able to serve as 
extensions of their users' memories, to implement their 
problem-solving and decision-making capabilities, and gener- 
ally to augment their cognitive performance in a variety of 
ways. But this is speculation ...". 

This statement seems somewhat too conservative. The examples 
employed in this report underline the fact that the development 
in this area ha3 already led to results whioh are no longer desig- 
nated as speculation, although this compilation is in no way com- 
plete . The examples were also greatly simplified intentionally. 

There are a number of other approaches in the area of artificial 
intelligence with possible application for this research not taken 
into consideration at all here. 

It must, however, be recognized that the development of an 
"intelligent partner relationship" between man and computer is 
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just beginning. In addition it must be stated that tnis approach 
probably be employed only for exactly defined cognitive tu3ke, 

As (the only?) alternative to the increasing automation in 
oomplex teohnioal systems this approach, however, deserves consider- 
ably more attention than it has reoeived up to the present. The 
possibility for substantially increasing cognitive performance of 
man without increasing stress and without turning to full automa- 
tion in opening here. 
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